A Diversity-Promoting Objective Function for Neural Conversation Models

نویسندگان

  • Jiwei Li
  • Michel Galley
  • Chris Brockett
  • Jianfeng Gao
  • William B. Dolan
چکیده

Sequence-to-sequence neural network models for generation of conversational responses tend to generate safe, commonplace responses (e.g., I don’t know) regardless of the input. We suggest that the traditional objective function, i.e., the likelihood of output (response) given input (message) is unsuited to response generation tasks. Instead we propose using Maximum Mutual Information (MMI) as the objective function in neural models. Experimental results demonstrate that the proposed MMI models produce more diverse, interesting, and appropriate responses, yielding substantive gains in BLEU scores on two conversational datasets and in human evaluations.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Exploring Personalized Neural Conversational Models

Modeling dialog systems is currently one of the most active problems in Natural Language Processing. Recent advances in Deep Learning have sparked an interest in the use of neural networks in modeling language, particularly for personalized conversational agents that can retain contextual information during dialog exchanges. This work carefully explores and compares several of the recently prop...

متن کامل

The efficiency of Artificial Neural Network, Neuro-Fuzzy and Multivariate Regression models for runoff and erosion simulation using rainfall simulator

1- INTRODUCTION According to the complexity of environmental factors related to erosion and runoff, correct simulation of these variables find importance under rain intensity domain of watershed areas.  Although modeling of erosion and runoff by Artificial Neural Network and Neuro-Fuzzy based on rainfall-runoff and discharge-sediment models were widely applied by researchers, scrutinizing Arti...

متن کامل

A Hybrid Optimization Algorithm for Learning Deep Models

Deep learning is one of the subsets of machine learning that is widely used in Artificial Intelligence (AI) field such as natural language processing and machine vision. The learning algorithms require optimization in multiple aspects. Generally, model-based inferences need to solve an optimized problem. In deep learning, the most important problem that can be solved by optimization is neural n...

متن کامل

Modeling and Multi-Objective Optimization of Stall Control on NACA0015 Airfoil with a Synthetic Jet using GMDH Type Neural Networks and Genetic Algorithms

This study concerns numerical simulation, modeling and optimization of aerodynamic stall control using a synthetic jet actuator. Thenumerical simulation was carried out by a large-eddy simulation that employs a RNG-based model as the subgrid-scale model. The flow around a NACA0015 airfoil, including a synthetic jet located at 10 % of the chord, is studied under Reynolds number Re = 12.7 × 106 a...

متن کامل

An Attentional Neural Conversation Model with Improved Specificity

In this paper we propose a neural conversation model for conducting dialogues. We demonstrate the use of this model to generate help desk responses, where users are asking questions about PC applications. Our model is distinguished by two characteristics. First, it models intention across turns with a recurrent network, and incorporates an attention model that is conditioned on the representati...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016